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3 <110> APPLICANT: Yamanaka, Shinya 

4 Dainippon Sumitomo Pharma Co., Ltd. 

6 <120> TITLE OF INVENTION: Screening method for somatic cell nuclear reprogramming 
substance 

8 <130> FILE REFERENCE: 701049 
C--> 10 <140> CURRENT APPLICATION NUMBER: US/10/589 , 905 
C--> 10 <141> CURRENT FILING DATE: 2006-08-18 

10 <150> PRIOR APPLICATION NUMBER: JP 2004-042337 

11 <151> PRIOR FILING DATE: 2004-02-19 

13 <150> PRIOR APPLICATION NUMBER: JP 2004-232961 

14 <151> PRIOR FILING DATE: 2004-08-10 

16 <150> PRIOR APPLICATION NUMBER: JP 2004-276572 

17 <151> PRIOR FILING DATE: 2004-09-24 
19 <160> NUMBER OF SEQ ID NOS : 50 

21 <170> SOFTWARE: Patentln Ver . 2.1 

23 <210> SEQ ID NO: 1 

24 <211> LENGTH: 1623 

25 <212> TYPE: DNA 

2 6 <213> ORGANISM: Mus musculus 
28 <220> FEATURE: 

2 9 <221> NAME /KEY : CDS 

30 <222> LOCATION: (50).. (1369) 

32 <400> SEQUENCE: 1 

33 tgactgatct tgagtttgca taggcttcct gcggtgaaac gggtacact atg gcc tct 58 

34 Met Ala Ser 

35 1 

37 ctg aag agg ttt cag acg etc gtg ccc ctg gat cac aaa caa ggt acc 106 

3 8 Leu Lys Arg Phe Gin Thr Leu Val Pro Leu Asp His Lys Gin Gly Thr 
39 t 5 10 15 

41 tta ttt gaa att att gga gag ccc aag ttg ccc aag tgg ttc cat gtc 154 

42 Leu Phe Glu lie lie Gly Glu Pro Lys Leu Pro Lys Trp Phe His Val 

43 20 25 30 35 

45 gaa tgc ctg gaa gat cca aaa aga ctg tac gtg gaa cct egg eta ctg 202 

46 Glu Cys Leu Glu Asp Pro Lys Arg Leu Tyr Val Glu Pro Arg Leu Leu 

47 40 45 50 

49 gaa ate atg ttt ggt aag gat gga gag cac ate cca cat ctt gaa tct 250 

50 Glu lie Met Phe Gly Lys Asp Gly Glu His lie Pro His Leu Glu Ser 

51 55 60 65 

53 atg ttg cac acc ctg ata cat gtg aac gtg tgg ggc cct gaa agg cga 2 98 

54 Met Leu His Thr Leu lie His Val Asn Val Trp Gly Pro Glu Arg Arg 

55 70 75 80 

57 get gag att tgg ata ttc gga ccg ccg cct ttc cga agg gac gtt gac 346 

58 Ala Glu lie Trp lie Phe Gly Pro Pro Pro Phe Arg Arg Asp Val Asp 

59 85 90 95 
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61 egg atg etc act gat ctg get cac tat tgc cgc atg aaa ctg atg gaa 394 

62 Arg Met Leu Thr Asp Leu Ala His Tyr Cys Arg Met Lys Leu Met Glu 

63 100 105 110 115 

65 ata gag get ctg gag get gga gtt gag cgt cgt cgt atg gcg gec cat 442 

66 lie Glu Ala Leu Glu Ala Gly Val Glu Arg Arg Arg Met Ala Ala His 

67 120 125 130 

6 9 aag get gee ace cag cct get ccc gtg aag gtc cgc gag get gee cct 4 90 

70 Lys Ala Ala Thr Gin Pro Ala Pro Val Lys Val Arg Glu Ala Ala Pro 

71 135 140 145 

73 egg ccc get tec gtg aag gtc cct gag acg gee acc cag cct get ccc 538 

74 Arg Pro Ala Ser Val Lys Val Pro Glu Thr Ala Thr Gin Pro Ala Pro 

75 150 155 160 

77 gtg aag gtc cgc gag get gee cct cag ccc get ccg gtg cag gag gtc 586 

78 Val Lys Val Arg Glu Ala Ala Pro Gin Pro Ala Pro Val Gin Glu Val 

79 165 170 175 

81 cgc gag get gee cct cag cag get tec gtg cag gag gag gtc cgc gag 634 

82 Arg Glu Ala Ala Pro Gin Gin Ala Ser Val Gin Glu Glu Val Arg Glu 

83 180 185 190 195 

85 get gee acc gag cag get ccc gtg cag gag gtc cgc gag get gee acc 682 

86 Ala Ala Thr Glu Gin Ala Pro Val Gin Glu Val Arg Glu Ala Ala Thr 

87 200 205 210 

89 gag cag get ccc gtg cag gag gtc age gag get gee acc gag cag get 730 

90 Glu Gin Ala Pro Val Gin Glu Val Ser Glu Ala Ala Thr Glu Gin Ala 

91 215 220 225 

93 ccc gtg cag gag gtc aac gag get gee acc gag cag get tec gtg cag 778 

94 Pro Val Gin Glu Val Asn Glu Ala Ala Thr Glu Gin Ala Ser Val Gin 

95 230 235 240 

97 gcg gtc cgc gag get gee acc egg ccg get ccc ggg aag gtc cgc aag 82 6 

98 Ala Val Arg Glu Ala Ala Thr Arg Pro Ala Pro Gly Lys Val Arg Lys 

99 245 250 255 

101 gcg gee acc cag ccg get ccg gtg cag gtt tgc cag gag gee acc cag 874 

102 Ala Ala Thr Gin Pro Ala Pro Val Gin Val Cys Gin Glu Ala Thr Gin 

103 260 265 270 275 

105 ttg get ccc gtg aag gtc cgc gag gcg gee acc cag ccg get tec ggg 922 

106 Leu Ala Pro Val Lys Val Arg Glu Ala Ala Thr Gin Pro Ala Ser Gly 

107 280 285 290 

109 aag gtc cgc gag gcg gee acc cag ttg get cct gtg aag gtc cgc aag 970 

110 Lys Val Arg Glu Ala Ala Thr Gin Leu Ala Pro Val Lys Val Arg Lys 

111 295 300 305 

113 gca gee acc cag ttg get cct gtg aag gtc cac gag gcg gee acc cag 1018 

114 Ala Ala Thr Gin Leu Ala Pro Val Lys Val His Glu Ala Ala Thr Gin 

115 310 315 320 

117 ccg get ccg ggg aag gtc age gat get gee acg cag teg get teg gtg 1066 

118 Pro Ala Pro Gly Lys Val Ser Asp Ala Ala Thr Gin Ser Ala Ser Val 

119 325 330 335 

121 cag gtt cgt gag get gee acg cag ctg tct ccc gtg gag gee act gat 1114 

122 Gin Val Arg Glu Ala Ala Thr Gin Leu Ser Pro Val Glu Ala Thr Asp 

123 340 345 350 355 

125 act age cag ttg get cag gtg aag get gat gaa gee ttt gee cag cac 1162 
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X £. V 


Thr 


Ser 


Gin 


Leu 


r\x a 


m n 


V d _L 


xi y o 


r\x a 




VJ-L U 


Ala 


Php 


Ala 


Gin 


His 




XZ / 










760 










1 ^ 

J D 3 










170 






x *z. y 


act 


tea 


ggg 


gag 


npp 


cac 


cag 


gtt 


gee 


aat 


ggg 


cag 


tct 


ccc 


att 


gaa 


1210 

J- *J J- V/ 




Thr 


Ser Gly Glu 




His 


Gin 


Val 


Ala 


Asn 


Gly 


Gin 


Ser 


Pro 


He 


Glu 




1 J 1 








375 










380 










385 








X J J 


gtc 


tgt 


gag 


act 


gee 


acc 


ggg 


cag 


cat 


tct 


eta 


gat 


gtc 


tct 


agg 


gee 


1258 

-1- A-t u 


J. J ft 


Val 


Cys 


Glu 


Thr 


Ala 


Thr 


Gly 


Gin 


His 


Ser 


Leu 


Asp 


Val 


Ser 


Arg 


Ala 




i -jc 

1 J J 






390 










395 










400 










1-57 


ttg 


tec 


cag 


aag 


tgt 


cct 


gag 


gtt 


ttt 


gag 


tgg 


gag 


acc 


cag 


agt 


tgt 


1 7 06 


TOO 


Leu 


Ser 


Gin 


Lys 


Cys 


Pro 


Glu 


Val 


Phe 


Glu 


Trp 


Glu 


Thr 


Gin 


Ser 


Cys 




i j y 




405 










410 










415 












1 VI 1 

14 1 


ttg 


gat 


ggc 


age 


tat 


gtc 


ata 


gtt 


cag 


cct 


cca 


agg 


gat 


gec 


tgg 


gaa 


IOC/ 


14^ 


Leu Asp Gly Ser 


Tyr 


Val 


He 


Val 


Gin 


Pro 


Pro 


Arg 


Asp 


Ala 


Trp 


Glu 




14i 


420 










425 










430 










435 




14b 


tea 


ttt 


ate 


ata 


tta 


taaatgeate tctggtgtga gecaggatag atggtacacg 


u y 


146 


Ser 


Phe 


He 


He 


Leu 


























t A 1 

14 / 










440 


























149 


tetgeaaate cagaacctaa aggcaggggt 


tagcttgggc 


tgagtaaggc aatgatctta 


14b y 


TCI 


aacctcagcc 1 


tgectaagae tcccttcatc 


: tttctttctg 


gtttttgece 1 


taggaategg 


X D Z. zf 


153 


gaagaacaga gtagagctgt ttttgtttcc ccattgtgtt 


aaatgtttgc agacacaatt 


iron 

lb o y 


ICC 

lob 


taaagtattc 1 


taataaaaaa aaaattgeat 


. tccc 














i £ *? ^ 


158 


<210> SEQ ID NO 


: 2 


























159 


<211> LENGTH: 440 


























160 


<212> TYPE: 


PRT 




























Ibl 


<213> ORGANISM: 


MUS 


musculus 




















163 


<400> SEQUENCE: 


2 


























J-b4 


Met 


Ala 


Ser 


Leu 


Lys 


Arg 


Phe 


Gin 


Thr 


Leu 


Val 


Pro 


Leu 


Asp 


His 


Lys 




165 


1 








5 










10 










15 






16 / 


Gin 


Gly 


Thr 


Leu 


Phe 


Glu 


He 


He 


Gly 


Glu 


Pro 


Lys 


Leu 


Pro 


Lys 


Trp 




168 








20 










25 










30 








1 /U 


Phe 


His 


Val 


Glu 


Cys 


Leu 


Glu 


Asp 


Pro 


Lys 


Arg 


Leu 


Tyr 


Val 


Glu 


Pro 




171 






35 










40 










45 










± / J 


Arg 


Leu 


Leu 


Glu 


He 


Met 


Phe 


Gly 


Lys 


Asp 


Gly 


Glu 


His 


He 


Pro 


His 




1 /4 




50 










55 










60 












1 

J. / D 


Leu 


Glu 


Ser 


Met 


Leu 


His 


Thr 


Leu 


He 


His 


Val 


Asn 


Val 


Trp 


Gly 


Pro 




1 / / 


65 










70 










75 










80 




1 7Q 


Glu 


Arg 


Arg 


Ala 


Glu 


He 


Trp 


He 


Phe 


Gly 


Pro 


Pro 


Pro 


Phe 


Arg 


Arg 




180 










85 










90 










95 






182 


Asp 


Val 


Asp 


Arg 


Met 


Leu 


Thr 


Asp 


Leu 


Ala 


His 


Tyr 


Cys 


Arg 


Met 


Lys 




183 








100 










105 










110 








185 


Leu 


Met 


Glu 


He 


Glu 


Ala 


Leu 


Glu 


Ala 


Gly 


Val 


Glu 


Arg 


Arg 


Arg 


Met 




186 






115 










120 










125 










188 


Ala 


Ala 


His 


Lys 


Ala 


Ala 


Thr 


Gin 


Pro 


Ala 


Pro 


Val 


Lys 


Val 


Arg 


Glu 




189 




130 










135 










140 












191 


Ala 


Ala 


Pro 


Arg 


Pro 


Ala 


Ser 


Val 


Lys 


Val 


Pro 


Glu 


Thr 


Ala 


Thr 


Gin 




192 


145 










150 










155 










160 




194 


Pro 


Ala 


Pro 


Val 


Lys 


Val 


Arg 


Glu 


Ala 


Ala 


Pro 


Gin 


Pro 


Ala 


Pro 


Val 




195 










165 










170 










175 






197 


Gin 


Glu 


Val 


Arg 


Glu 


Ala 


Ala 


Pro 


Gin 


Gin 


Ala 


Ser 


Val 


Gin 


Glu 


Glu 
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198 180 185 190 

2 00 Val Arg Glu Ala Ala Thr Glu Gin Ala Pro Val Gin Glu Val Arg Glu 

201 195 200 205 

2 03 Ala Ala Thr Glu Gin Ala Pro Val Gin Glu Val Ser Glu Ala Ala Thr 

204 210 215 220 

206 Glu Gin Ala Pro Val Gin Glu Val Asn Glu Ala Ala Thr Glu Gin Ala 

207 225 230 235 240 

209 Ser Val Gin Ala Val Arg Glu Ala Ala Thr Arg Pro Ala Pro Gly Lys 

210 245 250 255 

212 Val Arg Lys Ala Ala Thr Gin Pro Ala Pro Val Gin Val Cys Gin Glu 

213 260 265 270 

215 Ala Thr Gin Leu Ala Pro Val Lys Val Arg Glu Ala Ala Thr Gin Pro 

216 275 280 285 

218 Ala Ser Gly Lys Val Arg Glu Ala Ala Thr Gin Leu Ala Pro Val Lys 

219 290 295 300 

221 Val Arg Lys Ala Ala Thr Gin Leu Ala Pro Val Lys Val His Glu Ala 

222 305 310 315 320 

224 Ala Thr Gin Pro Ala Pro Gly Lys Val Ser Asp Ala Ala Thr Gin Ser 

225 325 330 335 

22 7 Ala Ser Val Gin Val Arg Glu Ala Ala Thr Gin Leu Ser Pro Val Glu 
228 340 345 350 

230 Ala Thr Asp Thr Ser Gin Leu Ala Gin Val Lys Ala Asp Glu Ala Phe 

231 355 360 365 

233 Ala Gin His Thr Ser Gly Glu Ala His Gin Val Ala Asn Gly Gin Ser 

234 370 375 380 

23 6 Pro He Glu Val Cys Glu Thr Ala Thr Gly Gin His Ser Leu Asp Val 
237 385 390 395 400 
23 9 Ser Arg Ala Leu Ser Gin Lys Cys Pro Glu Val Phe Glu Trp Glu Thr 
240 405 410 415 

242 Gin Ser Cys Leu Asp Gly Ser Tyr Val He Val Gin Pro Pro Arg Asp 

243 420 425 430 

245 Ala Trp Glu Ser Phe He He Leu 

246 435 440 

250 <210> SEQ ID NO: 3 

251 <211> LENGTH: 1063 

252 <212> TYPE: DNA 

253 <213> ORGANISM: Homo sapiens 

255 <220> FEATURE: 

256 <221> NAME /KEY : CDS 

257 <222> LOCATION: (54).. (704) 

259 <400> SEQUENCE: 3 

260 tcggcctttg ggtttgctgt ggtgtccttg tctcctgcag gaccggccgc age atg 56 

261 Met 

262 1 

264 gac get ccc agg egg ttt ccg acg etc gtg caa ctg atg cag cca aaa 104 

265 Asp Ala Pro Arg Arg Phe Pro Thr Leu Val Gin Leu Met Gin Pro Lys 

266 5 10 15 

268 gca atg cca gtg gag gtg etc ggt cac etc cct aag egg ttc tec tgg 152 

269 Ala Met Pro Val Glu Val Leu Gly His Leu Pro Lys Arg Phe Ser Trp 
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z /o 






20 










Z D 










6 U 










z /z 


ttc 


cac 


tct 


gag 


etc 


ccg 


aag 


^ -a +- 


Ar*i si 

ccg 


aag 


gca 


gec 


/>-» /-T f~\ 

cgc 




g a g 




o no 

z u u 


Z / J 


Phe 


His 


Ser 


Glu 


pne 


lieu 


Liys 




pro 


L»ys 


vai 


vai 


Arg 


iieu 


GlU 


v ai 




274 




35 










A A 
4 U 






















276 


tgg 


ctg 


gtg 


gaa 


aag 


ate 


ttc 


ggc 


tfm *r*~ 

c gg 


ggc 


gg a 


gaa 


M y^W «g 

cgc 


ate 


y*i 

ccg 


cac 


Z *±o 


O *"7 *7 

277 


Trp 


Leu 


Val 


Glu 


Lys 


xi- 


pne 


(jiy 


Arg 


Gly 


Gly 


UlU 


7% -fc^- 

Arg 


lie 


pro 


nlS 




2 78 


50 










55 










60 










ob 




o o r\ 

280 


gtc 


cag 


ggt 


atg 


4- y-i 

tec 


caa 


ate 


ttg 


att 


cac 


gtg 


aat 


cga 


ttg 


gac 


CCu 


O Q C 

z y d 


281 


Val 


Gin Gly Met 


Ser 


Gin 


He 


Leu 


He 


HIS 


Val 


Asn 


Arg 


Leu 


Asp 


Pro 




282 










70 










75 










80 






2 84 


aac 


ggc 


gag 


get 


gag 


ate 


ttg 


gta 


4-4-4- 
tt t 


ggg 


agg 


cct 


tct 


tac 


cag 


gag 


*3 >1 /I 
J 44 


285 


Asn Gly Glu Ala 


Glu 


He 


Leu 


Val 


Pne 


Gly 


Arg 


Pro 


Ser 


Tyr 


Gin 


Glu 




286 








85 










90 










95 








2 88 


gac 


aca 


ate 


aag 


atg 


ate 


atg 


aac 


ctg 


get 


gac 


tat 


cac 


cgc 


cag 


etc 


TOO 

J yz 


-r-"-, x*v 

289 


Asp 


Thr 


He 


Lys 


Met 


He 


Met 


Asn 


Leu 


Ala 


Asp 


Tyr 


His 


Arg 


Gin 


Leu 




290 






100 










105 










110 










292 


cag 


gcg 


aaa 


ggc 


^ _ 

tea 


gga 


aag 


gec 


etc 


gec 


cag 


gat 


gtc 


gee 


act 


cag 


44 0 


293 


Gin 


Ala 


Lys 


Gly 


Ser 


Gly 


Lys 


Ala 


Leu 


Ala 


Gin 


Asp 


Val 


Ala 


Thr 


Gin 




294 




115 










120 










125 












296 


aag 


gec 


gag 


acc 


cag 


egg 


tct 


tea 


ata 


gaa 


gtc 


egg 


gag 


gec 


ggg 


acg 


488 


297 


Lys 


Ala 


Glu 


Thr 


Gin 


Arg 


Ser 


Ser 


He 


Glu 


Val 


Arg 


Glu 


Ala 


Gly 


Thr 




298 


130 










135 










140 










145 




~i f\ n 

300 


cag 


cgt 


teg 


gtg 


gag 


gtc 


egg 


gag 


gec 


ggg 


acc 


cag 


cgt 


teg 


gtg 


gaa 


r -1 f 


301 


Gin Arg 


Ser 


Val 


Glu 


Val 


Arg 


Glu 


Ala 


Gly 


Thr 


Gin 


Arg 


Ser 


Val 


Glu 




302 










150 










155 










160 






304 


gtc 


cag 


gag 


gtc 


ggg 


aca 


cag 


ggt 


tct 


ccg 


gtg 


gag 


gtg 


cag 


gag 


gee 


584 


305 


Val 


Gin 


Glu 


Val 


Gly 


Thr 


Gin 


Gly 


Ser 


Pro 


Val 


Glu 


Val 


Gin 


Glu 


Ala 




306 








165 










170 










175 








308 


ggg 


acc 


cag 


cag 


tct 


etc 


cag 


get 


gee 


aac 


aag 


teg 


ggg 


acc 


cag 


cga 


632 


309 


Gly Thr 


Gin 


Gin 


Ser 


Leu 


Gin 


Ala 


Ala 


Asn 


Lys 


Ser 


Gly 


Thr 


Gin 


Arg 




310 






180 










185 










190 










312 


tec 


ccc 


gaa 


get 


gee 


age 


aag 


gca 


gtg 


acc 


cag 


egg 


ttt 


cgc 


gag 


gat 


680 


313 


Ser 


Pro 


Glu 


Ala 


Ala 


Ser 


Lys 


Ala 


Val 


Thr 


Gin 


Arg 


Phe 


Arg 


Glu 


Asp 




314 




195 










200 










205 












316 


gec 


egg 


gac 


cca 


gtt 


act 


aga 


tta 


tgaaggcatc tcaggccctg gagecagage 


734 


317 


Ala 


Arg Asp 


Pro 


Val 


Thr 


Arg 


Leu 





















318 210 215 

320 cagtcagggg ttaaagtgaa agecegtatt tccgcccaga agctggggtt ggggagagga 794 

322 tgtggatttt ttgttttacc ctttctgttg catggttgca aacacaaact tgagttctaa 854 

324 taaagaattg caaagtggaa gcccgccccc cccctccccc ccgcctccct taagtccagg 914 

326 aagctggggt ggcgaggaag gatgatgtgg attgtttttg ttttacccct tttgttgaat 974 

328 ggttgccaac ccaaacttga gttttaataa ataattgect ttccaaaaaa aaaaaaaaaa 1034 

330 aaaaaaaaaa aaaaaaaaaa aaaaaaaaa 1063 

333 <210> SEQ ID NO: 4 

334 <211> LENGTH: 217 

335 <212> TYPE: PRT 

336 <213> ORGANISM: Homo sapiens 

338 <400> SEQUENCE: 4 

339 Met Asp Ala Pro Arg Arg Phe Pro Thr Leu Val Gin Leu Met Gin Pro 
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